Approximation, Estimation and Control of Stochastic Systems Under a Randomized Discounted Cost Criterion
نویسندگان
چکیده
The paper deals with a class of discrete-time stochastic control processes under a discounted optimality criterion with random discount rate, and possibly unbounded costs. The state process {xt} and the discount process {αt} evolve according to the coupled difference equations xt+1 = F (xt, αt, at, ξt), αt+1 = G(αt, ηt) where the state and discount disturbance processes {ξt} and {ηt} are sequences of i.i.d. random variables with densities ρ and ρ respectively. The main objective is to introduce approximation algorithms of the optimal cost function that lead up to construction of optimal or nearly optimal policies in the cases when the densities ρ and ρ are either known or unknown. In the latter case, we combine suitable estimation methods with control procedures to construct an asymptotically discounted optimal policy.
منابع مشابه
Optimization of the Inflationary Inventory Control Model under Stochastic Conditions with Simpson Approximation: Particle Swarm Optimization Approach
In this study, we considered an inflationary inventory control model under non-deterministic conditions. We assumed the inflation rate as a normal distribution, with any arbitrary probability density function (pdf). The objective function was to minimize the total discount cost of the inventory system. We used two methods to solve this problem. One was the classic numerical approach which turne...
متن کاملApproximation and estimation in Markov control processes under a discounted criterion
We consider a class of discrete-time Markov control processes with Borel state and action spaces, and <−valued i.i.d. disturbances with unknown density ρ. Supposing possibly unbounded costs, and under standard continuity and compactness conditions, we combine suitable density estimation methods of ρ with approximation procedures of the optimal cost function, to construct asymptotically discount...
متن کاملAn Approximation Framework for Infinite Horizon Stochastic Dynamic Optimization Problems with Discounted Costs
Traditional approaches to solving stochastic optimal control problems involve dynamic programming, and solving certain optimality equations. When recast as stochastic programming problems, structural aspects such as convexity are regained, and solution procedures based on decomposition and duality may be exploited. This paper explores a class of stationary, infinite-horizon stochastic optimizat...
متن کاملAn actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes
We develop in this article the first actor–critic reinforcement learning algorithm with function approximation for a problem of control under multiple inequality constraints. We consider the infinite horizon discounted cost framework in which both the objective and the constraint functions are suitable expected policy-dependent discounted sums of certain sample path functions. We apply the Lagr...
متن کاملA Robust Distributed Estimation Algorithm under Alpha-Stable Noise Condition
Robust adaptive estimation of unknown parameter has been an important issue in recent years for reliable operation in the distributed networks. The conventional adaptive estimation algorithms that rely on mean square error (MSE) criterion exhibit good performance in the presence of Gaussian noise, but their performance drastically decreases under impulsive noise. In this paper, we propose a rob...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Kybernetika
دوره 45 شماره
صفحات -
تاریخ انتشار 2009